A tracial quantum central limit theorem 
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We prove a central limit theorem for non-commutative random variables in a von Neumann algebra with a 
tracial state: Any non-commutative polynomial of averages of i.i.d. samples converges to a classical limit. The 
proof is based on a central limit theorem for ordered joint distributions together with a commutator estimate 
related to the Baker-Campbell-Hausdorff expansion. The result can be considered a generalization of Johans- 
son's theorem on the limiting distribution of the shape of a random word in a fixed alphabet as its length goes to 
infinity. 
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1. INTRODUCTION 

One of the most important results in mathematics for sci- 
ence is the central limit theorem. But there is only an incom- 
plete theory of central limits in the setting of quantum prob- 
ability theory, even though the quantum rules of probability 
describe physical reality just as the classical rules do. In this 
paper we will prove a quantum central limit theorem for tra- 
cial states. Our result is not as sharp or as general as what one 
may conjecture, but it is sharper than previous central limits 
theorems in the same setting [g, |], |J, 

The difference between quantum and classical central lim- 
its only appears in the multivariate case. In quantum proba- 
bility theory, a real-valued random variable is expressed by 
a self-adjoint operator in a von Neumann algebra (see be- 
low). Quantum behavior only arises with non-commuting op- 
erators. In particular independent variables do commute, so 
the proof of the classical central limit theorem applies to in- 
dependent, identically distributed (i.i.d.) real-valued quantum 
random variables. 

The classical theorem does not apply to i.i.d. samples of a 
vector of random variables with non-commuting components. 
For example, let A and B be two non-commuting measure- 
ments in a quantum probability space and let A and B be 
the corresponding averaged measurements in the larger space 
consisting of N independent copies of J£ . We would 
like to say that the joint distribution of A and B converges to 
a Gaussian distribution. But since A and B are not simultane- 
ously measurable, it is not clear what this means. One physi- 
cally observable interpretation is that any suitable self-adjoint 
expression, such as the anti-commutator AB +BA, should ap- 
proach a classical distribution. 

Our result follows this interpretation. To state it precisely 
we introduce some notation. Let (^,p) a quantum proba- 
bility space consisting of a von Neumann algebra ^# and a 
normal state p. Then 
copies of (^#,p). If A £ Jt, then 



denotes the fcth copy of A in , while 

r A W +A (2) + ... +A (N) 



w ,p 8JV ) denotes N independent 
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N 

is the rescaled sum of N independent samples of A. Finally if 
A is a quantum random variable, Jz?[A] denotes its distribution, 
or spectral measure. 

Theorem 1. Let (^#,p) be a quantum probability space with 
a tracial state p, and let A l ,A 2 , ■ ■ ■ ,A a be self-adjoint ele- 
ments with mean 0. Let p GC(A 1 ,A 2 ,. . . ,A„) be a self-adjoint 
non-commutative polynomial in k variables. Then 

Hm &\p{A v A 2 , . . . ,A a )} = J?[p(X v X 2 , . . . ,X a )], 

where X ll X 2l . ■ ■ ,X a are classical Gaussian random variables 
with covariance matrix 

E[X j X k ]=p(A j A k ). 

We briefly review the terminology and notation in Theo- 
rem [j] and the discussion above gj. A von Neumann alge- 
bra is an algebra of bounded operators on some Hilbert space 
which is both * -closed and closed in the strong (equivalently 
weak) operator topology. A state p is a bounded, ^-invariant 
functional on which is non-negative on positive elements 
of it is called normal if it is continuous with respect to 
the strong (equivalently weak) operator topology. A state p is 
tracial if 

p(AB)=p(BA) 

for all A,B S M. If A £ is self-adjoint, then any normal 
state p on induces a measure on the spectrum of A, the 
(scalar-valued) spectral measure on A. Thus we can interpret 
A as a real-valued random variable, this is the Copenhagen or 
Born interpretation in physics. If ^# is commutative, then it is 
analogous to a rj-algebra, while p is analogous to a measure 
on the algebra. 

In physics terminology, jM is a suitably complete calcu- 
lus of operators on a quantum system. The state p then cor- 
responds to a density operator or an ensemble, although the 
mathematical convention is to express it as a linear functional 
on operators rather than as an operator. I.e., p(A) denotes 
the expectation or mean of A with respect to p, often written 
Tr(pA) in physics. 
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The ring C(A 1 ,... ,A a ) of non-commutative polynomials 
admits a unique ^-involution which fixes each A„ and which, 
as usual, is anti-linear and an algebra anti-automorphism. A 
polynomial p is self-adjoint if p* = p. For example, the 
commutator [A^A-J is anti-self-adjoint, but ;[Aj,A 2 ] is self- 
adjoint. 

Readers who aren't interested in general von Neumann al- 
gebras can consider the special case ^ — the k x k ma- 
trices, and 



P(A) 



Tr(A) 



In this case each A„ is a k x k Hermitian matrix. We interpret 
a Hermitian matrix A as a random variable by the formula 

P[A = A] = f 

if X is an eigenvalue of A with multiplicity m. This special 
case captures the difficulties of the general case. We refer 
readers to Nielsen and Chuang JTT[ ] or Sakurai [|l3]| for intro- 
ductions to quantum probability theory. 

With these definitions we can discuss the limitations of 
Theorem [j]. Possibly the most serious one is that p must 
be tracial. We previously conjectured Theorem [I] without 
assuming that p is tracial, only that the covariance matrix 
of Aj,A 2 ,... ,A„ is symmetric jlOQ. Goderis, Verbeure, and 
Vets found a quantum central limit for any state p , namely a 
quasi-free state (or in physics terminology, a product of ther- 
mal states of harmonic oscillators) [^]. Thus we conjecture 
that in Theorem [j] we can let p be any normal state if we 
replace "classical Gaussian random variables" by "quasi-free 
variables". 

Of course Theorem [j] applies to the case where p does 
not symmetrize all of ^ , only the subalgebra generated by 
Aj, . . . ,A„, since we can replace by this subalgebra. In 
physics language it suffices for A,,... ,A a to commute with 
the density matrix. This occurs, for example, if p is a thermal 
state (also called a Gibbs state or the Boltzmann distribution) 
of a Hamiltonian H and each A„ is a conserved quantity. It also 
occurs in the infinite-temperature limit of any system, because 
in this limit the density matrix approaches the identity. 

Another limitation of Theorem [j] is the requirement that 
each A„ is bounded. By contrast the classical central limit the- 
orem requires only that the covariance matrix is finite. Thus 
a more satisfying version would allow each A„ to be an un- 
bounded operator affiliated with although with enough 
restrictions that p(A u . . . ,A a ) is still well-defined. Finally the 
polynomial ring C(Aj ,.. . ,A a ) could be replaced by some von 
Neumann algebra to which C(A i , . . . ,A„) sa is affiliated. This 
algebra would model measurable non-polynomial expressions 
in the variables A l , . . . ,A a ; it would be a non-commutative 
analogue of the algebra L°°(W) of bounded, measurable func- 
tions on K". 

Despite limitations, Theorem [l] is useful. For instance, we 
previously showed that it implies Johansson's theorem on the 
limiting distribution of the shape of a random word in k letters 
as the length N goes to infinity [[| [7, 10 1. Indeed Theorem [l] 
can be taken as a generalization of Johansson's theorem. 



1.1. Some previous results 

Many results in the literature are called or could be called 
quantum central limit theorems. Here we discuss a few of 
that apply to a discrete set of independent, non-commuting 
random variables, as Theorem [j] does. 

Cushen and Hudson proved the quantum central limit the- 
orem for a pair of conjugate variables P and Q, i.e., such that 
[P, Q] = il [BJ. Quaegebeur later generalized this result to ar- 
bitrary CCR algebras iQ. 

Giri and von Waldenfels proved that the general non- 
commutative moment p (AjA 2 . . . A a ) converges to a Gaussian 
or quasi-free value [Q]. This can be considered an "algebraic" 
quantum central limit theorem because it applies to any *- 
algebra. 

Goderis, Verbeure, and Vets established the convergence of 
p® N (e ) for all A € j# to the characteristic functions of a 
quasi-free state of a universal CCR-algebra over j£ Jp]]. 

If we compare these three results and Theorem [l|to each 
other, none is eclipsed by the others. As implied above, we 
conjecture that there is a mutual generalization of all four re- 
sults in the von Neumann algebra setting. 

Finally Voiculescu and many followers have developed a 
non-commutative probability theory in which conventional 
statistical independence is replaced by free independence in 
the sense of non-commutative free products [pA|. This is a 
very interesting theory which may yet be useful in physics, 
but the results in this paper are not part of it. 
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2. OUTLINE OF THE PROOF 

We use ./# sa to denote the space of self-adjoint elements of 
% . We abbreviate A = (Aj,A 2 , . . . ,A a ). We also abbreviate 



(A«i,A 2 «2, 



and we use x ■ y to denote the standard inner product on W. 

Our proof of Theorem |l| is based on a multivariate general- 
ization of the spectral measure Jz? [A] that we call the ordered 
joint distribution 

_2f [Aj , A 2 , . . . ,A a ] — _Sf [A] . 
It can be defined by its Fourier transform 

J?[A] (T)=p (e"' A i e il 2 A 2 . . . e'K A "" ) . 

The ordered joint distribution is not directly observable, 
among other reasons because it allows negative and even non- 
real "probabilities". 
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As a first step, an essentially classical central limit theorem 
holds for ordered joint distributions. Consequently for any 
positive exponents a l ,a 2 ,... , cc a , the distribution 

J?\A^,A^,...,A%«]=J2>\A } 

also approaches a classical limit. When p is tracial, the differ- 
ence between an ordered joint characteristic function, 

Jf[A] (7) = p (e"^ 1 M" 2 • • • ) , 

and the characteristic function of a linear combination of pow- 
ers of A V A 2 ,... ,A a , 

p{e Kh^+r 2 A^+-+t k ^)) = p(/^), 

is bounded by a decaying commutator estimate. Thus the 
spectral characteristic functions converge pointwise to a clas- 
sical limit when p(A) is a linear combination of powers. This 
implies many special cases of Theorem |lj indeed every case 
up to a change of variables. 



3. THE PROOF 

The proof is more natural in the C* -algebra setting than the 
von Neumann algebra setting. Recall that a C* -algebra is spe- 
cial kind of complex Banach algebra and that every von Neu- 
mann algebra is a C* -algebra [g]. C* -algebras have states, but 
they do not have normal states in the absence of weak and 
strong operator topologies. By the GNS construction, given 
any state p on a C* -algebra sd ', there exists a von Neumann 
algebra ^ D &/ and an extension of p which is normal on ^ , 
Henceforth we replace the von Neumann algebra by a C*- 
algebra s/ and drop the inessential condition that p is normal. 
(This is analogous to considering probability distributions on 
topological spaces instead of measure spaces.) 

We call a norm || ■ || s on a C*-algebra s£ spectral if it is 
* -invariant and if 

||KA|| S =||A|| S 
for any unitary U. We consider the GNS norm 

\\a\\ p = ^p~(Fa) 

on srf , which is spectral when p is tracial. (If p is not faithful, 
it is only a semi-norm.) By the Russo-Dye theorem [§], 

||AB|| 5 <min{||A|| s ||B||, ||A||||B|| S } 

in any spectral norm 1 1 • | \ s . In particular 

l|A|| s <||l|| s ||A||, 

so the topology induced by ||A|| 5 is at least as coarse as the 
norm topology. 



If happens to be a von Neumann algebra and A e £/ sa has 
a discrete spectrum, then its spectral measure Jz?[A] is given 
by the rule 

P[A = \]=p(A x ), 

where A is an eigenvalue of A and A^ denotes projection onto 
its A-eigenspace. If A l7 A 2 ,... ,A„ 6 s/ sa all have discrete 
spectra, the ordered joint distribution Jz? [A] is likewise given 
by 

P[A = X}=p((A 1 ) Xi (A 2 ) h ...(A a )J. 

This is generally a complex- valued measure rather than a non- 
negative measure. Without the discrete spectrum assumption, 
Jz? [A] might not strictly be a measure at all. Recall that a prob- 
ability measure on W can be regarded as a bounded functional 
on the space C (R fl ) of continuous functions on W that vanish 
at infinity. In this paper we define Jz? [A] as a function on the 
space of products 

fix) =f\{Xi)f 2 {x 2 )---fa{Xa) 
of decaying, continuous, univariate functions by the rule 

J?lA](f)=p(f 1 (A 1 )f 2 (A 2 )---f a (A a )). 

Since p is bounded, Jz?[A] is a bounded functional on the pro- 
jective Banach tensor product (^(M) 8 " (i.e., the minimal com- 
pletion of the algebra tensor product, or the completion with 
respect to the greatest cross norm). In order to be a genuine 
measure, Jz? [A] would have to extend from C (K)® a to the 
much larger Banach space C (M a ). This is not always pos- 
sible, although Jz? [A] might naturally extend to some inter- 
mediate Banach space. The distribution Jz?[A] is also sensi- 
tive to permutations of the variables. But it otherwise reason- 
ably generalizes the usual joint distribution of classical ran- 
dom variables. 

Remark. For conjugate variables P and Q, the ordered joint 
distribution is related to the Wigner distribution. The Fourier 
transform of the Wigner distribution is given by 

W(q,p)=p{e ip ^ 2 e iQp e' p ^ 2 ). 

We can then compare 

&\P,Q](9,P) ^p{e lPci e^) = e^'w^p) 
S[Q,P]{j>,q) =p(W«) - e'^W&p). 

Thus the Fourier transform of the Wigner distribution is a 
phase-corrected form of the Fourier transform of the ordered 
joint distribution. 

Theorem 2. If si is a C* -algebra with a tracial state p and 
Aj,A 2 , . . . ,A a £ £/ sa with mean zero, then their ordered joint 
distribution Jz? [A] obeys the central limit theorem: 

Hm Jz?[A] = JS?[X] 
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weakly as functionate on C Q (R)® a , and X is Gaussian with 



covariance matrix 



M 



' jJ( =E[X j X k ]=p{A j A k ) 



Proof. We follow standard proofs of the central limit theorem 
by the method of characteristic functions [Q], but the argu- 
ment must be applied carefully because Jzf [A] is a complex 
measure. We will argue convergence of «5f [A] on four classes 
of functions in turn: 

1. products of sinusoids, 

2. products of smooth functions with bounded support, 

3. products of bounded, smooth functions, and 

4. products of bounded, continuous functions. 

Observe that if B l ,B 7l ... ,B a are self-adjoint elements in 
another C* -algebra with a state T and 



aW =A n ®ie. 



B®=I®Bn€ 



then 



5?\aM+b®] = £'[a]*5?[b], 

where the operation * denotes convolution. Thus 



= i?[A]), 



where T x denotes the operator on functionals induced by 
rescaling space by a factor of x. Equation (jl|) implies con- 
vergence of characteristic functions: 



^[A](t)=p m {e"- A )^e 



-PMt/Z 



locally uniformly in t. 
Suppose that 

m=fl(h)f 2 (t 2 )---f a {t a ) 

is a product of smooth univariate functions with bounded sup- 
port. Its Fourier transform / decays super-polynomially. Con- 
sider the joint expectation 

^[A]{f) = {2n)-«l 2 f XftftTftdt 



as an integral in Fourier space. For any R > 0, the integral 
converges inside the box [— R,R] a because there the integrand 
converges uniformly in ?. On the other hand, the integrand 
vanishes outside of the box uniformly in N as R — > °° because 
/ decays and for all t, 



\&{A]$)\ = |p®V'''^ 



< 1. 



(Both p® N and its argument have norm 1.) Thus for these /, 

J?[A}(f)^J?[X](f). 



Now suppose that f { , f 2 , . . . , f u have bounded support but 
are merely continuous. By the Weierstrass approximation the- 
orem, for each £ > 0, we can let 

/nW=/i 1) (0+/i 2) W, 
where (t) is smooth and 

f(2) 



Then 



#$](f)= E P 0iV (/ 1 (<Tl) (A 1 )/ 2 ^)(A 2 ).../(^(A a )). 
Se{i,2} a 

Each term of the sum other than the first one is bounded by e, 
while the first term converges to Jzf[X](f) if we take e — > 0. 
Thus for continuous / with bounded support, 

j?[2](f)^j?[x}(f). 

Finally suppose that /j, /,,... ,f a are bounded but do not 
have bounded support. Without loss of generality we suppose 
that 

Il/»II<1 

for each n. For each R > 0, we can choose a continuous parti- 
tion 



(1) where 



fn(t)=fP(t)+fP(t), 



/« (0 = min{max{0, \t\ - *}, !}/„(*). 



Thus 



fn(t) 



fP(t) \t\<R 



[/i 2) (0 \t\>R+i' 

By the univariate central limit theorem, 

Jim||/i 2 )(A„)|| p ^=0 
uniformly in N. Let e > and choose R such that for each n, 



Since for each «, 



\\fP(A n )\\ pm <e. 



I/, (2) (A„)||<1, 



and since 1 1 • 1 1 ^ is spectral, each term other than the first in 
the expansion 

#$\{f) = E P m (ft l) (A 1 )f^(A 2 )---f^(A a )) 
5e{\,2} a 

is bounded by e. (The p^-norm of the argument in each 
term is bounded by e.) As before, the first term converges to 
Sf[X](f) if we take e — * 0. Thus for all / described in the 
theorem, 



J?[A}(f)^J?[X](f). 



□ 
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Lemma 3. If A,B are elements of a C* -algebra srf , then 

e A e B -e A+B = f /'"' e t ^e^- t -^[A,B]e sA e^ B ds dt. 
Jo Jo 

Proof. Let 

X = e A l« Y = e B l". 
By elementary calculation, 

X"Y" - (XY)" = £"£ \xY) r - l X"- r - s [X,Y}X s Y"- t . (2) 

t=l s=0 



Since 



limX n '=e tA 



lim Y ,n = e tB 



lim (XY)'" = e r{A+B ^> lim n 2 [X,Y] = [A,B], 

equation (||) converges to the statement of the lemma as n goes 
to infinity. □ 

Lemma 4. IfA 11 A- ) , . . . ,A a are self-adjoint elements of a C* - 
algebra si and \ \ • || s is a spectral semi-norm, then 

\\e iA i e iA 2 ■ ■ -e iA <* - e i ( A i+ A 2+-+ A *)\\ s < £ IIEV^]ll£_ 

\<j<k<a ^ 

Proof. If a — 2, then Lemma |3| implies that 

^ e iA le iA 2 _ e i(A 1+ A 2 )^ < ll^v^Hs 

by taking the norm inside the integral. Note that the integral is 
defined convergence in the algebra norm, but it is equally valid 
to exchange it with any other form of convergence which is at 
least as weak, as convergence in 1 1 • | \ s is necessarily. 

The general case follows by induction. □ 

Let C{A} denote the Lie algebra freely generated by the 
symbols A l7 A 2 , ■ ■ ■ ,A a . Recall that its universal enveloping 
algebra is the ring of non-commutative polynomials in the 
same variables: 

U(C{A})=C(A). 

Lemma 5. IfX andY commute, a, /3 G Z >0 , and q > 1, then 

a+p 

X a Y$ = £ t n {X + q n Y) a+ P 

n=0 

for some t ,t u ... ,? a+jS G R. 

Proof. The coefficients f ,f 1; ,t a+ p must satisfy the linear 

system 



«=o [0 k^a 



for < k < a + [5. The matrix of this system is a Vander- 
monde matrix, invertible when q > 1 , so the system has a so- 
lution. □ 



Lemma 6. Any p G C(A) can be expressed as a linear com- 
bination of power sums 

p(A) = fjfif + fcjjjj + • • • + t h B^ =7-0 

with each B n G C{A 1 ,A 2 , . . . ,A a } and each /3„ G Z >0 . //'p is 
self-adjoint, then we can take each t n G M anif eac/i Z?„ ?o £>e 
self-adjoint. 

Proof. The Poincare-Birkhoff-Witt theorem provides a vector 
space isomorphism 

<E> : S(L) — ► C/ (L) 

from the symmetric algebra to the universal enveloping alge- 
bra of any Lie algebra L [Q], The map <t> is given by sym- 
metrization: 

4>(XjX 2 • • -X n ) = -£ ^(l) X i(2) ' ' ' X K{n) 

for any X l , X 2 , . . . , X n EL. If L has an anti-involution *, then it 
extends to both S(L) and U (L) and <t> intertwines it. The map 
<t> also preserves powers of elements of L: 

<P(X") =X n . 

It therefore also preserves linear combinations of powers. 

We claim that if L is a vector space, then every element of 
the symmetric algebra S(L) is a linear combination of pow- 
ers. Lemma |] establishes the special case that the product of 
two powers is a linear combination of powers. The set P of 
linear combinations of powers is therefore closed under mul- 
tiplication; it is a (complex) subalgebra of S(L). Since L C P 
trivially and L generates S(L), we conclude that P — S(L). If 
L has an anti-linear ^-involution, then the same argument ap- 
plies to 5'(L)sa, which is generated as a real algebra by the real 
vector space L sa . 

In conclusion, for any Lie algebra L, powers of elements 
of L span U(L) as a complex vector space. If L has an anti- 
involution *, then powers of elements of L sa span U (L) sa as a 
real vector space. The lemma is the special case 



C{A j ,A 2 , . 



□ 



The proof of Theorem |l] is simpler if the vector of Lie 
elements B provided by Lemma ^ is linear in A. (This in- 
cludes many interesting choices for p, for instance the anti- 
commutator A { A 2 +A 2 Aj.) Indeed in this case 

~ ^ ~P 
p(A)=t-B 

is a polynomial in B, so we could replace A with B in the 
statement of the theorem. 

Lemma 7. If A,B G s/ sa have mean and a,j3 € Z >0 , then 



\\[A a ,BP}\\ pm = 0(N-^)\\A\\\\B\\. 
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Before proving Lemma [7] in full generality, we motivate it 
with a simple proof when a = f5 = 1 . In this case 



\/N 



Since [A,B] has mean (because p is tracial), the typical 
eigenvalue of [A,B] is 0(1). Thus the norm of the right side is 

0(N- x l 2 ). 

Proof. In brief, since 

p(A)=p(B)=p([A,B})=0, 
and since p is tracial, the multilinear expansion of 



p m ([A a ,BP] 



±\\[A a ,B% m 



has 0(N a+ P ') non-cancelling terms. Since the expansion 
also has a factor of N a+ P in the denominator, the square of 
the norm is O (N~ 1 ) . 

In detail, the expansion is 



N a+p p m ([A a ,P} 2 ) 



£ lY\p{X n Y n Z n W n )-Y[p(YnX n Z n W n ) 

- Y\p(X n Y„W„Z n ) +Y[p(Y n X n W„Z n ) J , (3) 



where each vector X and Z consists of a copies of A and 
N — a copies of /, and each vector Y and W consists of j3 
copies of B and N — [5 copies of /. Momentarily fix X , Y , Z, W 
and consider the corresponding four terms in equation (|3|). 
If for some n, exactly one of X n ,Y„,Z„,W n is A or B, then 
all four terms vanish. Likewise if for all n, not all four of 
X„,Y n ,Z n ,W n are A or B, then the four terms cancel. Thus 
in a non-cancelling choice of the vectors X,Y,Z,W, the four 
components X n , Y n , Z„ , W n are non-trivial for at most a + /3 — 1 
values of n. There are only 0(N a+ P~ l ) such terms, each 
boundedby ||A|| 2 ||B|| 2 . ' □ 



Applying Theorem || to B, we learn not only that 

lim ££ [B] = if [Y] 

with Y — (Y l ,Y 2 , . . . ,Y b ) Gaussian, but also that 



limJSf[J3P]=JSf[?0]. 



Thus for every t, 



&[SP] (7) = p® N {e'A l ^ . . . M" ) - 



jsf[y% (4) 



approaches a classical limit as N — » °°. Combining Lemma | 
with Lemma 0, 



I it.B P i it^B P * it,B p b 

\e 1 ie 2 2 ■ ■ - e b t 



it-B 



0(N- 1/2 ). (5) 



Finally combining equations (Q) and (||) with 



|P 8A, |I P «, = 1, 



we obtain 



Replacing Fby zt, we obtain 

Since p(X) = 7- Y^ and pointwise convergence of characteris- 
tic functions implies weak convergence of measures, we have 
established Theorem [l] when B is linear. 

The idea behind the general case is that the non-linear terms 
in each B„ decay as — > °°. We expand each B n as a sum of 
homogeneous terms: 



B >i- B n.\ + B n.2~ 



n4„> 



where B n d is the degree d term of B„. Then 



p{A 1 ,A 2 ,...,A a )=t l C^+t 2 C^+---+t b O i 



where 



C„ = B nl +N-V\ 2 + ... +N^' 2 B ndn . 
Furthermore 

P( B ,,x)-P( B ,,2)=^ 

in the first case because B n j is linear, and in the second case 
because B n 2 is a commutator and p is tracial. It follows that 



Lemma 8. 



lim p(C„) =0. 



lim S?[C}=j?[X}, 



where X is Gaussian with covariance matrix 
M jtk =E[X j X k ]=p(B jtl B ktl ). 
In other words, 

if [Cj,C 2 , ■ ■ ■ ,Cj] 
approaches the same classical limit as 

^{ B i,v B 2.v- > B b,\\- 
Proof. Observe that 

nmC„=fi nl . 

The lemma follows from the proof of Theorem ||, where it 
was left unstated that all estimates are locally uniform in A. 
We substitute C — p (C) for A in the theorem and use the fact 
that p(C„) -*■(). □ 

The rest of the proof of Theorem [l] follows the same argu- 
ment as the case when B is linear by substituting C for B. 
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4. NOT THE PROOF 

To understand the proof of Theorem [j], it may help to see 
why some alternative lines of argument do not suffice. 

The Giri-von-Waldenfels central limit theorem implies that 

the moments of p(A) converge to the moments of p(X) by 
multilinear expansion. However, unless p is either linear or 
positive-definite quadratic, p(X) is not uniquely determined 
by its moments because the tail of Jzf[»(X")] is too thick. How- 
ever, it is yet possible that Theorem [y would follow from the 
Giri-von-Waldenfels theorem together with an analytic theory 
of non-commutative moments. 

The Goderis-Verbeure-Vets central limit theorem estab- 
lishes the convergence of any 

p m {e g \ 

which can be interpreted as a joint characteristic of A. Indeed 
Lemma || more generally establishes the convergence of 

p m ( e e V A e ff 2 -A , , , e ff b i^ 

for any sequence of vectors 7 l , 7 2 , . . . ,7,. However, absent an 
analytic theory of non-commutative characteristic functions, 
this does not imply Theorem [j]. 

The Baker-Campbell-Hausdorff expansion also shows that 



approximates 

Unfortunately the infinite sum of the BCH expansion does not 
(as far we know) commute with the limit N — > °°. 

If p is not tracial, the proof of Lemma ^ fails for the non- 
spectral norm 

\\A\\ p =p(A*A). 
Alternatively, if a is a tracial state, then 

IIp®"IU = IIpII?->- 

exponentially, assuming that srf even has a tracial state and 
that ||p|| CT is finite. Lemma || generalizes to higher-order com- 
mutators which decay more and more quickly, but these suffer 
from the same exchange-of-limits problem as the BCH expan- 
sion when combined with the norm of p®^. Theorem || also 
depends on the assumption that p is tracial, but there the as- 
sumption may be unnecessary. 

Finally when p is not tracial, then ||- || p is at least left- 
invariant under unitary multiplication. If each commutator 
[A.-jAjJ is central, then we can move the unitary factors in 
Lemma || to the left and consequently establish Lemma 
for || • ||p. This proves a slight generalization of Theorem [jj 
However, it is equivalent to the Cushen-Hudson-Quaegebeur 
central limit theorem [l2|], except that they do not require 
bounded random variables. 
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